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November 2004 Proceedings of the 2nd international conference on Embedded 
networked sensor systems 

Full text available: 'gpdf(291.68 KB) Additional Information: full citation , abstract, references , index terms 

An efficient and reliable file storage system is Important to micro sensor nodes so that data 
can be logged for later asynchronous delivery across a multi-hop wireliess sensor network. 
Designing and implementing such a file system for a sensor node faces various challenges. 
Sensor nodes are highly resource constrained in terms of limited runtime memory, limited 
persistent storage, and finite energy. Also, the flash storage medium on sensor nodes 
differs in a variety of ways frorn the traditiona ... 

Keywords: eeprom, file system, flash, log structured, reliability, sensor 



^ Enhancing software reliability with speculative threads 
Jeffrey Opiinger, Monica S. Lam 

October 2002 Proceedings of the 10th international conference oh Architectural 

support for programming languages and operating systems, volume 37 , 36 

30 Issue 10 , 5 , 5 

Full text available: ^£df{l,47 MB) Additional Information: f ull citation , a b stract , r eferences , citing s 

This paper advocates the use of a monitor-and-recover programming paradigm to enhance 
the reliability of software, and proposes an architectural design that allows software and 
hardware to cooperate in mal<ing this paradigm more efficient and easier to program. We 
propose-that programmers write monitoring functions assuming simple sequential execution 
semantics. Our architecture speeds up the computation by executing the monitoring 
functions speculatively in parallel with the main computation. For ... 

^ Embra: fast and flexible machine simulation 
Emmett Witchel, Mendel Rosenblum 

May 1996 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 1996 
ACM SIGMETRICS international conference on Measurement and modeling 
of computer systems, volume 24 issue i 

Full text available: ' Ppdf(1.83 MB) Additional Information: full citation, abstract , references , citings , index 




Results (page 1 ): +rewrite +cycle +count static file dynamic flash Page 2 of 6 



of uniprocessors and cache-coherent multiprocessors. When running as part of the SimOS 
simulation environnnent, Embra models the processors of a MIPS R3000/R4000 machine 
faithfully enough to run a commercial operating system and arbitrary user applications. To 
achieve high simulation speed, Embra uses dynamic binary translation to generate code 
sequences which simulate the workload. It is the first machine simu ... 

* System s upport for pervasive applications 

Robert Grimm, Janet Davis, Eric Lemar, Adam Macbeth, Steven Swanson, Thomas Anderson, 

Brian Bershad, Gaetano Borriello, Steven Gribble, David Wetheralj 

November 2004 ACM Transactions on Computer Systems (TOCS), volume 22 issue 4 

Full text available: pdf(1 .82 M B) Additional Information: full citation, abstract , referen ces, index ter ms 

Pervasive corhputing provides an attractive vision for the future of computing. 
Computational power will be available everywhere. Mobile and stationary devices will 
dynamically connect and coordinate to seamlessly help people in accomplishing their tasks. 
For this vision to become a reality, developers must build applications that constantly adapt 
to a highly dynamic computing environment. To make the developers' task feasible, we 
present a system architecture for pervasive computing, called & ... 

Keywords: Asynchronous events, checkpointing, discovery, logic/operation pattern, 
migration, one.world, pervasive computing, structured I/O, tuples, ubiquitous computing 



Continuous profiling: where have all tlie cy cles g one? 

Jennifer M. Anderson, Lance M. Berc, Jeffrey Dean, Sanjay Ghemawat, Monika R. Henzinger, 
Shun-Tak A. Leung, Richard L. Sites, MarkT. Vandevoorde, Carl A. Waldspurger, William E. 
Weihl 

October 1997 ACM SIGOPS Operating Systems Review , Proceedings of tlie sixteentli 

ACM symposium on Operating systems principles, volume 3i issue 5 
Full text available: ^bM 2.29 MB) Additional Information: fu l l citati on , references , citinos . ind ex term s 




® F ast detection of communication patterns in distributed execution s j 
Thomas Kunz, Michlel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of tlie Centre for Advanced Studies 
oh Collaborative research 

Full text available: g:Bdi(4,21_.MS) Additional Information: full cit a tio n, ab strac t, re ferenc es , index ter ms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 

^ Session summaries from the 17th svmposium on operating systems princip le i 
(SOSP'99) ■ 
Jay Lepreau, Eric Eide 

April 2000 ACM SIGOPS Operating Systems Review, Volume 34 Issue 2 
Full text available: gfidf(3J5JVIB} Additional Information: full citation , index terms 
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Shun-Tak A. Leung, Richard L. Sites, Mark T. Vandevoorde, Carl A. Waldspurger, William E. 
Wei hi 

November 1997 ACM Transactions on Computer Systems (TOCS), volume 15 issue 4 

Full text available- ^ pdf(259 35 KB) Additional Information: ful l citation , abstract , references, ci t ing s , index 
• yy . t erm s 

This article describes the Digital Continuous Profiling Infrastructure, a sampling-based 
profiling system designed to run continuously on production systems. The system supports 
multiprocessors, works on unmodified executables, and collects profiles for entire systems, 
including user programs, shared libraries, and the operating system kernel. Samples are 
collected at a high rate (over 5200 samples/sec. per 333MHz processor), yet with low 
overhead (1-3% slowdown for most workloads). A ... 

Keywords: performance understanding, performance-monitoring hardware, profiling, 
program analysis 



^ F ine -grain access c on trol for dis tr ibuted share d memory 

loannls Schoinas, Babak Falsafi, Alvin R. Lebeck, Steven K. Reinhardt, James R. Larus, David 
A.Wood 

November 1994 Proceedings of tlie sixtli international conference oh Architectural 

support for programming languages and operating systems, volume 29 , 
28 Issue 11,5 

Full text available: g Ddfd .20 MB) Additional Information: full citation, abstract, references, citings , index 
■ ^ terrns 

This paper discusses implementations of fine-grain memory access control, which 
selectively restricts reads and writes to cache-block-sized memory regions. Fine-grain 
access control forms the basis of efficient cache-coherent shared memory. This paper 
focuses on low-cost implementations that require little or no additional hardware. These 
techniques permit efficient implementation of shared memory on a wide range of parallel 
systems, thereby providing shared-memory codes with a portability ... 

10 ESP: a language for programmable devices 
Sanjeev Kumar, Yitzhak i^landelbaum, Xiang Yu, Kai LI 

May 2001 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 2001 conference 

on Programming language design and implementation, volume 36 issue 5 
Full text available- Wl Rdftl 60 MB) Additional Information: full citation , abs t ract , references , citin g s , ind ex 
" " t e r m s 

This paper presents the design and implementation of Event-driven State-machines 
Programming (ESP)— a language for programmable devices. In traditional languages, like C, 
using event-driven state-machine forces a tradeoff that requires giving up ease of 
development and reliability to achieve high performance. ESP is designed to provide all of 
these three properties simultaneously. 

ESP provides a comprehensive set of features to support development of compact and 
modular programs. ... 



Queue management: Persistent dropping: an efficient control of traffic a aa re a ates 
Hani Jamjoom, Kang G. Shin 

August 2003 Proceedings of the 2003 conference on Applications, technologies, 
architectures, and protocols for computer communications 

Full text available: fBpdf(804.16 KB) Additional Information: full citatio n, abstract, r eferences , citings, i ndex 
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to the point of service failure. Traditional rate-based drop policies, such as Random Early 
Drop (RED), become ineffective in such situations since clients tend to be persistent, in the 
sense that they make multiple retransmission attempts before aborting their connection. As 
it is built into TCP's congestion co ... 

Keywords: flash crowd events, modeling, optimization, queue management 



12 Multithreading and value prediction: Specula tive lock elision : e nablin g hig hl y [ 
. concurrent multithreaded execu ti on 

Ravi Rajwar, James R. Goodman 

December 2001 Proceedings of the 34th annual ACM/IEEE international symposium on 
Microarchitecture 

Full text available: ^f,^ o-, ^ 

^mtlL^iXMI^^ Additional Information: full citation , abstract, references , citings 
Publisher Site 

Serialization of threads due to critical sections is a fundamental bottleneck to achieving high 
performance in multithreaded programs. Dynamically, such serialization may be 
unnecessary because these critical sections could have safely executed concurrently without 
locks. Current processors cannot fully exploit such parallelism because they do not have 
mechanisms to dynamically detect such false inter-thread dependences.We propose 
Speculative Lock Elision (SLE), a novel micro-architectura ... 

13 Read ing text from c o mp u t er screens | 
Carol Bergfeld Mills, Linda J. Weldon 

December 1987 ACM Computing Surveys (CSUR), volume i9 issue 4 

Full text available: Pj pdf(3.33 MB) Additional Information: full citati on, abstract, references, dHngs, in_d_ex 

terms , review 

This paper reviews empirical studies concerning the readability of text from computer 
screens. The review focuses on the form and physical attributes of complex, realistic 
displays of text material. I^ost studies comparing paper and computer screen readability 
show that screens are less readable than paper. There are many factors that could affect 
the readability of computer screens. The factors explored in this review are the features of 
characters, the formatting of the screen, the contras .;. 

"■^ E xperien c e w ith a_softwarerdefined jiiachine architecture | 
David W. Wall 

May 1992 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 14 Issue 3 

Full text available: f Q pdf(2.86 MB ) Additional Infomiation: Ml citat ion, abstract, references, citings, index 

^ - - terms , r eview - . 

We have built a system in which the compiler back end and the linker work together to 
present an abstract machine at a considerably higher level than the actual machine. The 
intermediate language translated by the back end is the target language of all high-level 
compilers and is also the only assembly language generally available. This lets us do 
intermodule register allocation, which would be harder if some of the code in the program 
had come from a traditional assembler, out of sight of ... 



Keywords: RISC, graph coloring, intermediate language, interprocedural, optimization, 
pipeline scheduling, profiling, register allocation, register windows 
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Vandercappelle, P. G. Kjeldsberg 

April 2001 ACM Transactions on Design Automation of Electronic Systems (TODAES), 

Volume 6 Issue 2 

Full text available: ■ gDdf(339.91 KB^ Additional Information: Mdtatjon, abstract, references. cLtings, index 

We present a survey of the state-of-the-art techniques used In performing data and 
memory-related optimization's in embedded systems. The optimizations are targeted 
directly or indirectly at the nnemory subsystem, and impact one or more. out of three 
important cost metrics: area, performance, and power dissipation of the resulting 
implementation. We first examine architecture-independent optimizations in the form of 
code transoformations. We next cover a broad spectrum of optimizati ... 

Keywords: DRAM, SRAM, address generation, allocation, architecture exploration, code 
transformation, data cache, data optimization, high-level synthesis, memory architecture 
customization, memory power dissipation, register file, size estimation, survey 



Specialization tools and techn i q u es f o r sy stematic optimiz ation of system software 
Dylan McNamee, Jonathan Walpole, Calton Pu, Crispin Cowan, Charles Krasic, Ashvin Gdel, 
Perry Wagle, Charles Consel, Gllles Muller, Renauld Marlet 
May 2001 ACM Transactions on Computer Systems (TOCS), volume i9 issue 2 

Full text available: Wi Rdf(1 78 .52 KB) Additional Information: Mi citation, abstract, r eferences , citings, index 
■ ^ = terms 

Specialization has been recognized as a powerful technique for optimizing operating 
systems. However, specialization has not been broadly applied beyond the research 
community because current techniques based on manual specialization, are time- 
consuming and error-prone. The goal of the work described in this paper is to help 
operating system tuners perform specialization more easily. We have built a specialization 
tooll<it that assists the major tasks of specializing operating systems. We de ... 

Keywords: operating system specialization, optimization, software architecture 



On com pile-time query o ptirnization in deductive databases by means of static filterin g B 
Michael Klfer, Elleze/ L Lozinskii 

September 1990 ACM Transactions on Databaise Systems (TODS), volume is issue 3 

Full text available: S pdf(3.49 MB ^ Additional Information: fuJLcitatiQn, abstract, references, citings, IndKc 
" terms , review 

We extend the query optimization techniques known as algebric manipulations with 
relational expressions [48] to work with deductive databases. In particular, we propose a 
method for moving data-independent selections and projections into recursive axioms, 
which extends all other known techniques for performing that task [2, 3, 9, 18, 20]. We 
also show that, in a well-defined sense, our algorithm is optimal among the algorithms that 
propagate data-independent selections through recursion. 



Keywords: dataflow, deductive databases, filtering, fixpoint, graph representation. 
Inference, projection, recursive rules, selection 
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Data abstraction and information hiding 
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This article describes an approach for verifying programs in the presence of data 
abstraction and information hiding, which are l<ey features of modern programming 
languages with objects and modules. This article draws on our experience building and 
using an automatic program checker, and focuses on the property of modular soundness: 
that IS, the property that the separate verifications of the individual modules of a program 
suffice to ensure the correctness of the composite program. We fo ... 

Keywords: Abstract. variables, abstraction dependencies, extehded static checking 
modifies clauses, modular verification, object-oriented programming, specifications . 



Speciajjssue on persistent object s ystems: Orthoaonallv pe rsistent nhjfirt systems 
Malcolm Atkinson, Ronald Morrison 

July 1995 The VLDB Journal - The International Journal on Very Large Data Bases 

Volume 4 Issue 3 » = , 

Full text available: gMf(5,02M.B) Additional Infonnation: full citat ion, abstract, references, cit i n gs 
Persistent Application Systems (PASs) are of increasing social and economic importance. 
They have the potential to be long-lived, concurrently accessed, and consist of large bodies 
of data and programs. Typical examples of PASs are CAD/CAM systems, office automation, 
CASE tools, software engineering environments, and patient-care support systems in 
hospitals. Orthogonally persistent object systems are intended to provide improved support 
for the design, construction, maintenance, and operation o ... 

Keywords: database programming languages, orthogonal persistence, persistent 
application systems, persistent programming languages 



^° Explgiting^deMj^a^ [ 
Milo M. Martin, Amir Roth, Charlies N. Fischer 

December 1997 Proceedings of the 30th annual ACM/IEEE international symposium on 
Microarchitecture 

Full text available, ig pdf(i jsjMBxS! Additional Information: full citation, abstract , references , citinos . index 
Publisher Site ternns 

We describe Dead Value Information (DVI) and introduce three new optimizations which 
exploit It. DVI provides assertions that certain register values are dead, meaning they will 
not be read before being overwritten. The processor can use DVI to track dead registers 
and dynamically eliminate unnecessary save and restore instructions from the execution 
stream at procedure calls and context switches. Our results indicate that dynamic saves arid 
restore instances can be reduced by 46% for procedure c ... 
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^ BegkteLalJocatjoimci^s.^ 
Vatsa Santhanam, Daryi Odnert 
June 1990 ACM SIGPLAN Notices , Proceedings of the ACI-i SIGPUVN 1990 conference 

on Programming language design and implementation, volume 25 issue 6 
Full text available: gpdfd.SI MB^ Additional information: full citation , abstract , references , citings , index 
'' te r ms 

This paper describes a method for compiling programs using interprocedural register 
allocation. A strategy for handling programs built from multiple modules is presented as 
well as algorithms for global variable promotion and register spill code motion These 
algorithms attempt to address some of the shortcomings of previous interprocedural 
fih^'flSObSdTrc^ite?^'^^" ^^^""^^ ^" implementation on a single register 

^ Optimising hot paths i n a dynamic binarv translator 
David Ung, Cristina Cifuentes 

March 2001 ACIwf SIGARCH Computer Architecture News, volume 29 issue i 

Full text available: g pdf(890.10 KB) Additional Information: full citation , abstract , citings, index terms 

In dynamic binary translation, code is translated "on the fly" at run-time, while the user ' 
perceives ordinary execution of the program on the target machine. Code fragments that 
are frequently executed follow the same sequence of flow control over a period of time 
?hP !fmnr^m^M^l ^""^ Optimised to improve the overall performance of 

the program.Multiple hot paths may also exist in programs. A program may choose to 
execute in one hot path for some time, but later switch to anot ... 

Keywords: binary translation, dynamic compilation, dynamic execution, run time profiling 



3 En^gpo-d Dday Consid^^^^ ^^^^j^ 

Victor Zyuban 

April 2002 Proceedings of the 12th ACM Great Lalces symposium on VLSI 

Full text available: ^,.pdfX109J.8 KB) Additional Information: jyjjcitafion, abstract , refe rence s, citin gs, index 

t erms 

J^n.fl^n^nrr^ °[ power-efflcient microprocessors presents the need to consider power 
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level stages, and the opportunity for making power-performance tradeoffs is the largest. 
Design modifications to the ISA and microarchitecture, however, affect most (if hot all) 
parameters of the design, including architectural speed, co ... 

Keywords: architecture, energy, energy-efficiency, metric, microarchitecture, 
performance, power 



System support for automatic profiling and optimization 

XIaolan Zhang, Zheng Wang, Nicholas Gloy, J. Bradley Chen, Michael D. Smith 

October 1997 ACM SIGOPS Operating Systems Review , Proceedings of tlie sixteentli 

ACM symposium on Operating systems principles, volume 3i issue 5 
Full text avallat}le: ^ pdf<1.95 MB) Additional Information: fu ll citation , references , citings , index term s 



® Building a robust software-based router usin g network processors 
Tammo Spalinl<, Scott Karlin, Larry Peterson, Yitzchak Gottlieb 

October 2001 ACM SIGOPS Operating Systems Review , Proceedings of the eigliteeiitli 

ACM symposium on Operating systems principles, volume 35 issue 5 
Full text available: ■ g|pdf(1.49 MB) Additional Infomiation: M.citation. abstract, references, citings, index 
'^^^^ " " terms 

Recent efforts to add new services to the Internet have increased interest in software-based 
routers that are easy to extend and evolve. This paper describes our experiences using 
emerging network processors— in particular, the Intel IXP1200— to implement a router. We 
show it is possible to combine an IXP1200 development board and a PC to build an 
inexpensive router that forwards minimum-sized packets at a rate of 3.47Mpps. This is 
nearly an order of magnitude faster than existing pure PC-base ... 

s Server per formance and scalability: A method for transparent admission control and g 
r e q uest sclieduling in e-commerce web sites 
Sameh EInikety, Erich IMahum, John Tracey, Willy Zwaenepoel 

May 2004 Proceedings of the 13th international conference on World Wide Web 

Full text available: g.fidf{i79J.8 KB^^ Additional Information: full citation, abs tract , r eferences , index ter ms ■ 

This paper presents a method for admission control and request scheduling for multiply- 
tiered e-commerce Web sites, achieving both stable behavior during overload and improved 
response times. Our method externally observes execution costs of requests online, 
distinguishing different request types, and performs overload protection and preferential 
scheduling using relatively simple measurements and a straight forward control mechanism. 
Unlike previous proposals, which require extensive changes to ... 

Keywords: admission control, dynamic web content, load control, request scheduling, web 
servers 



^ Memory simulators and software generators 
Guillermo Jimenez-Perez, Don Batory 

May 1997 ACM SIGSO FT Software Engineering Notes , Proceedings of the 1997 

symposium on Software reusability, volume 22 issue 3 
Full text available: ■^pdf(1.30 MB) Additional Information: full citation , references , index terms 
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Jaime H. Moreno, Mayan Moudgil 

July 1997 Proceedings of the 11th international conference on Supercomputing 

Full text available: gpdf(1.06 MB) Additional Information: full citation , referenc es. Index t erms 



^ Astro phvsical N-bodv simulations using hier archical tree data st ructures 
M. S. Warren, J. K. Salmon 

December 1992 Proceedings of the 1992 ACi^/IEEE conference on Supercomputing 

Full text available: pdf(709.46 .. KB} Additional Information: full citation , ref erences , citings . Index term s 



1 0 Processors a nd a c celerat ors for embedded applications: The iCOREtm 520 MHz 
sy nth esi zable C PU core • 

Nick Richardson, Lun Bin Huang, Razak Hossain, Tommy Zounes, Naresh Son!, Julian Lewis 
June 2002 Proceedings of the 39th conference on Design automation 

Full text available: ^ pdf(36 0 . 25 KB) Additional Information: f ull cita ti o n, ab s tract , refere n ce s. I ndex te rms 

This paper describes a new implementation of the ST20-C2 CPU architecture. The design 
involves an eight-stage pipeline with hardware support to execute up to three instructions 
in a cycle. Branch prediction is based on a 2-bit predictor scheme with a 1024-entry Branch 
History Table and a 64 entry Branch Target Buffer and a 4-entry Return Stack. The 
implementation of all blocks in the processor was based on synthesized logic generation 
and automatic place.and route. The full design of the CPU fro ... 

Keywords: ASIC, CPU, branch-prediction, cache, embedded, high-frequency, 
microarchitecture, pipeline, st20, synthesis 



Com p iler- directed selection of d ynamic me mory l ayouts 
Mahmut Kandemir, Ismail Kadayif 

April 2001 Proceedings of the ninth international symposium on Hardware/software 



Compiler technology is becoming a key component In the design of embedded systems, 
mostly due to increasing participation of software in the design process. Meeting system- 
level objectives usually requires flexible and retargetable compiler optimizations that can be 
ported across a wide variety of architectures. In particular, source-level compiler 
optimizations aiming at increasing locality of data accesses are expected to improve the 
quality of the generated code. Previous compiler-based ap ... 

Keywords: array reuse, data dependence, data locality, memory layout optimization, 
software compilation 



Dmamjcalyjrading Frequ ency for C omp lexit y in a G A LS Microprocessor 
Steven Dropsho, Greg Semeraro, David H. Albonesi, Grigorios Magklis, Michael L. Scott 
December 2004 Proceedings of the 37th International Symposium on Microarchitecture 

Full text available: 'gpdf(2 09.21 KB) Additional Information: full citation , abstract 

Microprocessors are traditionally designed to provide "best overall" performance across a 
wide range of applications and operating environments. Several arouos have orooosed 
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saving approach: dividing the processor into domains and dynamically changing the clocl< 
frequency and voltage within each domain during phases when the full ... 

13 WaveScalar , H 

Steven Swanson, Ken Michelson, Andrew Schwerin, Mark Oskin 

December 2003 Proceedings of the 36th Annual IEEE/ ACM International Symposium on 
Microarchitecture 

Full text available: ^.BdftmM KB) Additional Information: f ull citat i o n, a bstract, citin gs , i ndex terms 

Silicon technology will continue to provide an exponential increasein the availability of raw 
transistors. Effectively translatingthis resource into application performance, however,is an 
open challenge. Ever increasing wire-delay relativeto switching speed and the exponential- 
cost of circuit complexitymake simply scaling up existing processor designs futile.In this 
paper, we present an alternative to superscalardesign, WaveScalar. WaveScalar is a 
dataflow instructionset architecture and executi ... 



The im pact of architectural tre nds o n o p e ra tin g s ys tem performance Q 
M. Rosenblunfi, E. Bugnlon, S. A. Herrod, E. Witchel, A. Gupta 

December 1995 ACM SIGOPS Operating Systems Review , Proceedings of the fifteenth 

ACM symposium on Operating systems principles, volume 29 issue 5 
Full text available: ^jadf (2,03MB). Additional Infomnation: full citation , reference s, citi ngs , in dex term s 



15 Caches and Memory Systems: Heterogeneous memory management for embedded Q 
s yste m s , ' 

Oren Avissar, Rajeev Barua, Dave Stewart 

November 2001 Proceedings of the 2001 international conference on Compilers, 
architecture, and synthesis for embedded systems 

Full text available: pdf(241.12 KB) Additional Infomiation: full citation , abstract , references, citings , index 

terms 

This paper presents a technique for the efficient compiler management of software-exposed 
heterogeneous memory. In many lower-end embedded chips, often used in micro- 
controllers and DSP processors, heterogeneous memory units such as scratch-pad SRAM, ' 
internal DRAM, external DRAM and ROM are visible directly to the software, without 
automatic management by a hardware caching mechanism. Instead the memory units are 
mapped to different portions of the address space. Caches are avoided because of th ... 

Keywords: embedded, heterogeneous, memory, storage 



A simple g raph-based intermediate repre sentation 
Cliff Click, Michael Paleczny 

March 1995 ACM SIGPLAN Notices , Papers from the 1995 ACM SIGPLAN workshop on 

Intermediate representations, volume 30 issue 3 
Full text available: ^Mf{139_.MB) Additional Information: full cita tion, abstract, citings, inde x term s 

We present a graph-based intermediate representation (IR) with simple semantics and a 
low-memory-cost C++ implementation. The IR uses a directed graph with labeled vertices 
and ordered inputs but unordered outputs. Vertices are labeled with opcodes, edges are 
unlabeled. We represent the CFG and basic blocks with the same vertex and edge 
structures. Each opcode is defined by a C++ class that encapsulates opcode-specific data 
and behavior. We use inheritance to abstract common opcode behavior ... 
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M. J. Smith, D. H. Sleeman " 
June 1977 ACM SIGART Bulletin, issue 63 

i Full text available: gMf(687Z7.KB)[ Additional Information: f u l l citation, a bstract , r eferences 

A general purpose Production System interpreter, APRIL, Inas been developed to model 
aspects of students problem solving in a CAI environment. APRIL incorporates most of tlie 
facilities common to earlier interpreters and also includes several additional features. One 
new feature allows "high-level" routines, such as those used by-the interpreter itself in the 
matching and cycling phases, to be actions in production rules: hence It is. possible for the 
user to "create" an interpreter which is suitab ... 



Trap- d ri ven s i mu lation with T apewo r m II 

Richard Uhlig, David Nagle, Trevor l^ludge, Stuart Sechrest 

November 1994 Proceedings of the sixth international conference on Architectural 

support for programming languages and operating systems, volume 29 , 

28 Issue 11,5 

Full text available: « Ddfn.45MB^ Additional Infomiation: full citation, attract, referenceg. cjtings. index 

terms 

Tapeworm II is a software-based simulation tool that evaluates the cache and TLB 
performance of multiple-task and operating system intensive workloads. Tapeworm resides 
in an OS kernel and causes a host machine's hardware to drive simulations with kernel 
traps instead of vyith address traces, as is conventionally done. This allows Tapeworm to 
quickly and accurately capture complete memory referencing behavior with a limited 
degradation in overall system performance. This paper compares trap- ... 
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This paper introduces an architectural simulation environment, aimed at defining the best 
SOC architecture for complex system-level applications. The application is modeled using an 
abstract Timing Modeling Language, that describes the requests (e.g., memory accesses, 
I/Os, etc.) that the application makes to the architecture. The abstract architecture is 
modeled at the cycle-accurate level using a mixture of Esterel (a synchronous language) 
and C. We discuss the results of the application of ... 
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This paper presents the comparison of two methods of system monitoring, passive network 
monitoring and kernel instrumentation. The comparison is made on the basis of passive 
network monitoring being used as a replacement for kernel instrumentation in some 
situations. Despite the fact that the passive network monitoring technique is shown to 
perform poorly as a direct replacement for kernel instrumentation, this paper indicates the 
areas where passive network monitoring could be used to the great ... 
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